Implementing Radix Sort on Emu 1
نویسندگان
چکیده
This paper discusses the implementation of radix sort on Emu Solutions’ Emu 1 system. Sorting is an important building block for a large class of data-intensive applications. Radix sort, in particular, is a good candidate for sorting large sequences of fixed-length integer keys (for example, indexes in a database), because it presents lower computational complexity with respect to comparison based sorting algorithms when the length of the keys is small with respect to the dataset size. In general, implementing scalable sorting algorithms for large datasets is challenging, due to the large amount of memory operations with low data locality to move keys to their new positions. The Emu architecture is a memory-centric design based around the concept of lightweight threads that migrate across nodelets, a combination of memory and multithreaded processors aimed at providing a scalable large-memory system. We show how the Emu 1 design provides scalability in performance and size to our radix sort implementation.
منابع مشابه
A Fast Radix Sort
Almost all computers regularly sort data. Many different sort algorithms have therefore been proposed, and the properties of these algorithms studied in great detail. It is known that no sort algorithm based on key comparisons can sort N keys in less than O(N\og/V) operations, and that many perform 0(N) operations in the worst case. The radix sort has the attractive feature that it can sort N k...
متن کاملThe Effect of Local Sort on Parallel Sorting Algorithms
We show the importance of sequential sorting in the context of in memory parallel sorting of large data sets of 64 bit keys. First, we analyze several sequential strategies like Straight Insertion, Quick sort, Radix sort and CC-Radix sort. As a consequence of the analysis, we propose a new algorithm that we call Sequential Counting Split Radix sort, SCS-Radix sort. SCS-Radix sort is a combinati...
متن کاملConscious Radix Sort
The exploitation of data locality in parallel computers is paramount to reduce the memory traac and communication among processing nodes. We focus on the exploitation of locality by Parallel Radix sort. The original Parallel Radix sort has several communication steps in which one sorting key may have to visit several processing nodes. In response to this, we propose a reorganization of Radix so...
متن کاملModified Pure Radix Sort for Large Heterogeneous Data Set
We have proposed a Modified Pure Radix Sort for Large Heterogeneous Data Set. In this research paper we discuss the problems of radix sort, brief study of previous works of radix sort & present new modified pure radix sort algorithm for large heterogeneous data set. We try to optimize all related problems of radix sort through this algorithm. This algorithm works on the Technology of Distribute...
متن کاملFast radix sort for sparse linear algebra on GPU
Fast sorting is an important step in many parallel algorithms, which require data ranking, ordering or partitioning. Parallel sorting is a widely researched subject, and many algorithms were developed in the past. In this paper, the focus is on implementing highly efficient sorting routines for the sparse linear algebra operations, such as parallel sparse matrix matrix multiplication, or factor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015